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Abstract. If the equation l k + 2 k ^ h(m — 2) fe + (m-l) fc = m k has a solution 

with k > 2, then m > 10 10 . Leo Moser showed this in 1953 by remarkably 
elementary methods. His proof rests on four identities he derives separately. It 
is shown here that Moser's result can be derived from a von Staudt-Clausen type 
theorem (an easy proof of which is also presented here). In this approach the 
four identities can be derived uniformly. The mathematical arguments used in the 
proofs were already available during the lifetime of Lagrange (1736-1813). 

1. Introduction 

Consider the Diophantine equation 

l k + 2 k + ■ ■ ■ + (m - 2) k + (m - l) k = m k , (1) 

to be solved in integers (m,k) with m > 2 and k > 1. Note that in case k = 1 
the left-hand side of <^ equals m{m — l)/2, and this leads to the (unique) solution 
1 + 2 = 3. From now on we will assume that k > 2. Conjecturally solutions with 
k > 2 do not exist (this conjecture was formulated around 1950 by Paul Erdos in a 
letter to Leo Moser). Leo Moser [TD] established the following theorem in 1953. 

Theorem 1. (Leo Moser, 1953). If (m,k) is a solution of (pQ) with k > 2, then 
m > 10 106 . 

His result has since then been improved upon. Butske et al. [2] have shown, by 
computing rather than estimating certain quantities in Moser's original proof, that 
m > 1.485- io 9321155 . By proceeding along these lines this bound cannot be improved 
upon substantially. Butske et al. [2, p. 411] expressed the hope that new insights 
will eventually make it possible to reach the more natural benchmark 10 10 . This 
hope was recently fulfilled by Gallot, the author, and Zudilin [4], who showed that 
2k /(2m — 3) must be a convergent of log 2 and made an extensive continued fraction 
computation of (log2)/2iV, with iV an appropriate integer, in order to establish 
Theorem [2J Note that their result goes well beyond establishing the benchmark. 
Their approach only works for those N for which it can be shown that N\k. In [9] 
it was, e.g., shown that lcm(l, 2, . . . , 200) \k. 



Theorem 2. If (m, k) is a solution of ([T]) with k > 2, then m > 10 
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Moser's proof of Theorem [T] is quite amazing in the sense that he uses only very 
elementary number theory. His proof is even mathemagical in the sense that he 
pulls four rabbits out of a hat, namely the equations ([7]), (JTO]) . ([12]) . and ([TBI , that 
a solution (m, k) has to satisfy. He derives each of these equations separately in a 

l 



2 



PIETER MOREE 



quite ingenious way. In this note we will see that a reproof of Moser's result can be 
given, showing that the following result is the top hat the four rabbits were pulled 
from. 

Put5 r (y) = ESj r - 
Theorem 3. (Carlitz-von Staudt, 1961). Let r and y be positive integers. Then 

{- E(p-i)|r, P \y p ( mod V) otherwise. 

The latter sum is over the primes p such that both p — 1 divides r and p divides y. 
(Here and in the sequel the letter p is used to indicate primes.) Using this result, 
an easy proof of which will be given in Section [31 a less mathemagical reproof of 
Moser's result can be given. For a polished version of Moser's original proof, we 
refer the reader to the extended version of this note [8]. 

The prime harmonic sum diverges (as Euler already knew) and so given a > 1/2 
there exists a largest prime p(a) such that J2 p < P ( a ) Vp < a - Moser needed p(3.16) 
in his proof, but could only estimate it using certain prime number estimates. His 
proof is easily adapted to involve p(3|) and this was first exactly computed by 
Butske et al. [2] , leading to an improvement of Moser's bound, namely 

m> ( 3 n pf > l48s -io 9321155 . (3) 

p<p(H) 

We obtain, using Theorem [3] and a computer algebra package like PARI to com- 
pute p(3|) = 85861889 and the prime product in ([3]), the following variant of Moser's 
result. 

Theorem 4. Suppose that (m, k) is a solution of ([T|) with k > 2. Then 

1) m > 1.485 • 10 9321155 ; 

2) k is even, m = 3 (mod 8), m = ±1 (mod 3); 

3) m — 1, (m+ l)/2, 2m — 1, and 2m + 1 are all squarefree; 

4) if p divides at least one of the above four integers, then (p — l)\k; 

5) the number (m 2 — l)(4m 2 — 1)/12 is squarefree and has at least 4990906 prime 
factors. 

The proof we give in this note shows that if Lagrange (1736-1813) had a present- 
day computer, he could have proven Theorem HI 

In order to improve on Theorem[2]by Moser's approach one needs to find additional 
rabbit (s) in the top hat. The interested reader is wished good luck in finding these 
elusive animals ! 

2. Proof of Theorem @] 

Proof of Theorem® We will apply Theorem |3] with r = k. 

In case k is odd, we find, on combining (T5]) (putting y = m) with ([1]) and using 
the coprimality of m and m — 1, that m = 2 or m = 3, but these cases are easily 
excluded. (Since l k + 2 k < (1 + 2) k for k > 1, one sees that 1 + 2 k = 3 k has only 
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the solution k = 1.) Therefore k must be even. 

Take y = m — 1. Then using (OQ), the left-hand side of 02]) simplifies to 

S k (m - 1) = l k + 2 k + • • • + (m - 2) fc = m fe - (m - l) fc = 1 (mod m - 1). (4) 

We get from P) and (JU) that 

Em — 1 , _ . . . 
h 1 = (mod m - 1). (5) 

p|(m-l), (p-i)|A P 

Suppose there exists p|(m — 1) such that (p — 1) { fc. Then on reducing both sides 
of (jSJ) modulo p we get 1 = (modp). This contradiction shows that in (jSj) the 
condition (p — l)\k can be dropped, and thus we obtain 

Em — 1 . 
+ 1 = (modm-1). (6) 

p|(m-l) ^ 

Suppose there exists a prime p dividing m — 1 such that p 2 also divides m—1. Then 
on reducing both sides modulo p, we get 1 = (mod p). This contradiction shows 
that m — 1 must be squarefree. On dividing |6| by m - 1 we obtain 

E - + A eZ ' (7) 
A — ' p m—1 

p|(m-l) 

Take y = m. Then using ([[]) and 2|/c we infer from ()2]) that 

(p— l)|fc, p|m 

Since a sum of reciprocals of distinct primes can never be a positive integer, we infer 
that the sum in (jSJ) equals zero and hence conclude that if (p — 1) | k, then p\m. We 
conclude for example that (6,m) = 1. Now on considering (JTj) with modulus 4 we 
see that m = 3 (mod 8). 

Take y = m + 1. Then using ([T]) and the fact that /c is even, the left-hand side of 
(J2J) simplifies to 

Sfc(m + 1) = S'fe(m) + m fc = 2(m + 1 - l) fc = 2 (mod m + 1). 

We obtain 

V ^ + 2 = 0(modm + l), (9) 
^— ' p 

p|(m+l), (p-l)|fc ^ 

but by reasoning as in the case y — m — 1, it is seen that p| (m + 1) implies (p — 1) |/c 
and thus 

> - + r e z. 10 

^ p m + l 

From ((9]) and m = 3 (mod 8), we derive that (m + l)/2 is squarefree. 
Take y = 2m — 1. On noting that 

■m—1 

S k (2m - 1) = J^(j fc + (2m - 1 - j) k ) = 2S k {m) = 2m fc (mod 2m - 1), 
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we infer that 

+ 2m fc = (mod 2m- 1). (11) 

p|(2m-l), (p-l)|fc ^ 

Since m and 2m — 1 are coprime we infer that if p|(2m — 1), then (p — l)\k and 
m k ee 1 (modp), and furthermore that 2m — 1 is squarefree. By the Chinese 
remainder theorem it then follows that 2m fc ee 2 (mod 2m — 1), and hence from ( ITT]) 
we obtain 

z — ' p 2m — 1 

p|(2m-l) 

Take y = 2m + 1. On noting that 

m 

S fe (2m + 1) = E^' fc + (2m + 1 - = 2S fe (m + 1) ee Am k (mod 2m + 1) 
and proceeding as in the case y = 2m — 1 we obtain 

V - + — - — ez. (13) 

^ p 2m + 1 v 7 

p](2m+l) 

We further see that 2m + 1 is squarefree. 

No prime p > 3 can divide more than one of the integers m — 1, m + 1, 2m — 1, 
and 2m + 1. Further, since m ee 3 (mod 8) and 3 \ m, 2 and 3 divide precisely 
two of these integers. We infer that M = (m — l)(m + l)(2m — l)(2m + 1)/12 is a 
squarefree integer. On adding ([7]), (TlOl) . (JT2J), and ( TT3|) . we deduce that 

1 1 2 2 4 1 1 1 

> - + + + + >4 = 3-. 14 

^ p m - 1 m + 1 2m -1 2m+l~ 2 3 6 v; 

p|M r 

One checks that the only solutions of (j7|) with m < 1000 are 3, 7, and 43. These are 
easily ruled out by (fTO]) . Thus (TTTI) yields (with a = 3.16) z\^ p \ M - > a. From this 
it follows that if 



E 1 

n 



< a, (15) 

— - r> 

p<x 

then m 4 /3 > M > fL^P an< ^ nence 



m > S 1 /^/ 4 , (16) 

with 9(x) = z~2 p < x ^°SPj the Chebyshev ^-function. Since for example flT~5|) is satisfied 
with x = 1000, we find that m > 10 103 and infer from ( TT4"1) that we can take 
a = 3| — 10 -100 in (fT51) . Next one computes (using a computer algebra package) the 
largest prime such that z^2 Pj < Pk f~ < 3§, with pi,P2, ■ ■ ■ the consecutive primes 
(note that p^ = p(3|)). Here one finds that k = 4990906 and 

4990906 ^ j 

V - = 3.1666666588101728584 < 3- - 10~ 9 . 
^— ' Pi 6 
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By direct computation one finds that 6{p k ) = 8.58510010694053 •• • x 10 7 . Using 
this we infer from (I16p the inequality ([3]), and hence part 1 of the theorem is proved. 

Notice that along our way towards proving part 1, the remaining parts of the 
theorem have also been proved. □ 



3. Proof of the Carlitz-von Staudt theorem 

Carlitz [3 J gave a proof of Theorem [3] using finite differences and stated that the 
result is due to von Staudt. When r is odd, he claims that S r (y)/y is an integer, 
which is not always true (it is true though that 2S r (y)/y is always an integer). The 
author [6J gave a reproof using the theory of primitive roots and Kellner [5] a reproof 
(for r even only) using Stirling numbers of the second kind. Here a reproof will be 
given that is easier than all the above. It uses only the following result of Lagrange. 

Theorem 5. If f is a one-variable polynomial of degree n over TLjpTL, then it cannot 
have more than n roots unless it is identically zero. 

Proof. See, e.g., the book of Rose [HI Theorem 2.2, p. 39]. □ 

Lemma 1. Suppose that (p — 1) \ r. Then the equation x r ^ 1 (mod p) has a 
solution. 

Proof. Let T\ be the smallest positive integer such that r\ = r (mod p — 1). Then 
r\ < p — 1. Suppose that x r = 1 (mod p) for every x G {1, 2, ... ,p — 1}. Then by 
Fermat's little theorem we also have x ri = 1 (mod p) for every x e {1, 2, . . . , p— 1}, 
contradicting Lagrange's theorem. □ 

Lemma 2. Let p be a prime. We have 

S r (p) = e r (p) (mod p), 

where 

-1 if (p — l)|r; 



e r (p) 



otherwise. 



Proof. If p — 1 divides r the result follows by Fermat's little theorem. If (p— 1) \ r, as- 
sume that S r (p) ^ (mod p). Let a be an integer not divisible by p. Multiplication 
by a permutes the elements of Z/pZ and hence S r (p) = a r S r (p) (mod p), from which 
we infer that a r = 1 (mod p). Thus a r = 1 (mod p) for a = 1, 2, . . . ,p — 1. Invoking 
Lemma [TJ gives a contradiction, and hence our assumption that S r (p) ^ (mod p) 
must have been false. □ 

The usual proof of this result makes use of the existence of a primitive root modulo 
p, which provides a solution to x r ^ 1 (mod p) in case (p — 1) \ r. The proof given 
here only makes use of the more elementary theorem of Lagrange, Theorem [5j 



Lemma 3. Ifp is odd orp = 2 andr is even, we have S r (p x+1 ) = pS r (p x ) (mod p 



6 PIETER MOREE 

Proof. Every j with < j < p x+1 can be uniquely written as j = ap x + with 
< a < p and < /3 < p x . Hence we obtain on invoking the binomial theorem that 

p—i p A — i p A — i p—i p A — i 

s r (p x+i ) = E E ( a p x + w = p E ^ r + r p x E « E r_1 ( mod ^ 2A )- 

«=0 /3=0 /3=0 o=0 /3=0 

Since the first sum equals S r (p x ) and 2 a = p{p — 1) = (mod p), the result 
follows. □ 

Proof of Theorem |3j First let us consider the case where r is odd. We proceed by 
induction on y. Assume S r (m) is a multiple of m(m — l)/2. We need to show that 
S r (m + 1) = S r (m) + m r is a multiple of m(m + l)/2. 
If m is even, we have that m/2 divides S r (m). But 

777 777 

S r (m + 1) = (r + rrO + (2 r + (m - + ■ ■ • + ((-) 1 - + (- + l) r ), 

which is a multiple of m + 1 as each pair above is. Thus, S r (m + 1) is a multiple 
of m/2 as well as of m + 1. Since these are coprime S r (m + 1) is a multiple of 
m(m + l)/2. 

If m is odd, then m\S r (m). But 

777 -)- 1 

S r (m + 1) = (V + 7777) + (2'- + (m-in + --- + (^— ) r , 

which is a multiple of (m + l)/2 as each term is. Thus S r (m + 1) is a multiple of 
both m and (m + l)/2, which are coprime, and hence it is a multiple of m(m + l)/2. 
Next we consider the case where r is even. Suppose that p*\y. Then 

p/ p J — 1 

^(y) = E E ("p 7 + s ^^(^) ( mod p 1 )- 

o=0 /3=0 P 

By the Chinese remainder theorem it is enough to show that 

y 

S r (y) = -e r (p) (modp ep ), 
P 

where y = Yl P P ep ^ s a factorization of y into prime powers p £p . By f lT7|) . Lemma [31 
and Lemma El we then infer that 

S r (y) = ^S r {p e *) = V -S r {p) = V -e r (p) (mod p e ^), 
p e p p p 

thus concluding the proof. □ 

4. Concluding remarks 

A further application of the Carlitz-von Staudt theorem is to show that Giuga's 
conjecture (1950) and Agoh's conjecture (1990) are equivalent; see Kellner [5]. 
Giuga's conjecture states that if n > 2, then S' n _i(n) = —1 (mod n) if and only 
if n is prime. Agoh's conjecture states that if n > 2, then ni? n _i = — 1 (mod n) if 
and only if n is prime, where B r denotes the rth Bernoulli number. 
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The author has generalized the Carlitz-von Staudt theorem to deal with consecu- 
tive rth powers in arithmetic progression; see [5]. However, the method of proof in 
case r is odd given in Section [3] no longer applies in this more general situation. 

That Theorem [3] can be used to reprove Moser's result was first observed by the 
author in [7], where it played a key role in the study of the more general equation 
l k + 2 k + ■ • • + (m — l) fc = am k . The presentation given here also draws on computer 
improvements since 1996 and [2]. The proof of Theorem [3] given here is clearly easier 
than those given in [3J El [6], and is the main new contribution in this note. 

Some variants of the Erdos-Moser problem require computing p(a) for a > 3|; 
see, e.g., [7] The largest value for which p(a) has been computed is a = 4. Bach 
et al. [1] found that p(4) = 1801241230056600467, but whereas the computation of 
p(3|) is straightforward with a computer algebra package, computation of p(4) is 
rather more involved (using the Meissel-Lehmer algorithm). For a > 4, one presently 
has to resort to deriving a sharp lower bound for p(a) and here one is forced, as was 
Moser, to use prime number estimates; cf. [7]. 

Acknowledgement. The argument for Theorem [3] in case r is odd was suggested 
to me by B. Sury and the proof of Lemma [2] by D. Zagier. I would like to thank 
W. Moree, J. Sondow, and the referee for comments on an earlier version. 
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